Spoken English Intelligibility Remediation with PocketSphinx Alignment and Feature Extraction Improves Substantially over the State of the Art

نویسندگان

  • Yuan Gao
  • Brij Mohan Lal Srivastava
  • James Salsman
چکیده

Automatic speech recognition is used to assess spoken English learner pronunciation based on the authentic intelligibility of the learners’ spoken responses determined from deep neural network (DNN) model predictions of transcription correctness. Using numeric features produced by PocketSphinx alignment mode and many recognition passes searching for the substitution and deletion of each expected phoneme and insertion of unexpected phonemes in sequence, the DNN models achieve 97% agreement with the accuracy of Amazon Mechanical Turk crowdworker transcriptions, up from 75% reported by multiple independent researchers. Using such features with DNN prediction models can help computer-aided pronunciation teaching (CAPT) systems provide intelligibility remediation. We have developed and published free open source software so that others can use these techniques.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature extraction of hyperspectral images using boundary semi-labeled samples and hybrid criterion

Feature extraction is a very important preprocessing step for classification of hyperspectral images. The linear discriminant analysis (LDA) method fails to work in small sample size situations. Moreover, LDA has poor efficiency for non-Gaussian data. LDA is optimized by a global criterion. Thus, it is not sufficiently flexible to cope with the multi-modal distributed data. We propose a new fea...

متن کامل

جاسازی خط ویژگی وزن‌دار برای استخراج ویژگی تصاویر ابرطیفی

One of the most preprocessing steps before the classification of hyperspectral images is supervised feature extraction. Because obtaining the training samples is hard and time consuming, the number of available training samples is limited. We propose a supervised feature extraction method in this paper that is efficient in small sample size situation. The proposed method, which is called weight...

متن کامل

Feature reduction of hyperspectral images: Discriminant analysis and the first principal component

When the number of training samples is limited, feature reduction plays an important role in classification of hyperspectral images. In this paper, we propose a supervised feature extraction method based on discriminant analysis (DA) which uses the first principal component (PC1) to weight the scatter matrices. The proposed method, called DA-PC1, copes with the small sample size problem and has...

متن کامل

EFL Pronunciation Teaching: A Theoretical Review

This study aims to represent the developing status of pronunciation teaching and presents the current perspectives on pronunciation learning and teaching, coupled with innovative approaches and techniques/activities. It is argued that pronunciation teaching methodologies have changed over decades since the Reform Movement. The exact status of teaching pronunciation appeared first in the Audio L...

متن کامل

Contourlet-Based Edge Extraction for Image Registration

Image registration is a crucial step in most image processing tasks for which the final result is achieved from a combination of various resources. In general, the majority of registration methods consist of the following four steps: feature extraction, feature matching, transform modeling, and finally image resampling. As the accuracy of a registration process is highly dependent to the fe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1709.01713  شماره 

صفحات  -

تاریخ انتشار 2017